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*USER IDENTIFIED SIGNIFICANT RESULTS 


ABSTRACI 


ORSER-SSEL Technical Report 18-75 

CANONICAL ANALYSIS AND IRANSFOKMaIICN OF SKYLAB MULTISPECTRAL SCANNER DATA 
D. M. Barr and B- F. Merembeck 

Initial wcrk with digital data trotn selected Skylab scenes resulted 
in the selection or sixteen channels (.eleven bands) and In the development 
of 22 signatures. The scenes were all from SL3, Orbit 14, 5 August 1973, 
Tape 933847. The objective in this study was to process the data by means 
of canonical analysis, with a view toward improvement of classification, 
reduction of data bulk, and determinaticn of the value of each band for 
classification of various targets 

Statistical Infcrmatlcn tor each of the 22 signatures was obtained 
from the STaIS program, and training areas for specific targets were de- 
fined- The new signatures, number of observations, and variance-covariance 
matrixes for the signatures were obtained and used as input to the CANAL 
program. The objective here was to maximize the separability of the 
categories (the transtcrmatlon will usually also reduce the within-category 
variance) - 

It was determined that the first three axes contained 98.83% of the 
variance contained within the transformation; hence, only the first three 
axes were used in the transformation. Thus, canonical transformation is a 
form of feature selection, resulting here in an 81.25% reduction in data 
bulk. 

Evaluation of the c.lasslf Ication, now possible using the transformed 


data, has begun 


CANONICAL ANA^YSi-S AND TRANSFORMATION 


OF SKYLAB MULTISPECTRAL SCANNER DATA 
D M Barr and B. F. Merembeck 


OBJECTIVES 


The initial resultb of channel evaluation and classification of 
selected Skylab scenes (ORSER-SSEL Technical Report 2-75) resulted in the 
selection of sixteen channels (eleven bands) of data and development of 
twenty-two spectral signatures (six categories). aIj. scenes were from 
SL3, Orbit 14, 5 August 1973, Taps 933847. Following these results, 
preparations were begun to piocess the multispectral scanner data using 
the program CANAL (canonical analysis) developed by the Office for Remote 
Sensing of Earth Resources (ORSER).^ This procedure, with the subsequent 
analysis followed by a tr ans format icr. of the data using the canonical 
transformation matrix generated by CANAL, was Intended to accomplish the 
fol lowing: 


1. improve the ciassillcaticr. ; 

2. reduce the bulk of the oata; and 

3. increase our knowledge ci tf.e importance of the information 
contained In each band f.r wiasslficatir?. of different targets. 


PROCEDURE 


The fust step in the piuctos was to obtain specific statistical infor- 
mation fer each of the 22 slgnaturts This necessitated selection of perma- 
nent training areas The train: ng areas for each signature were defined by 
producing classlf Icaticn maps using each of the 22 original signatures as a 
category and then selecting the areas in which cnoy chat signature was 
mapped. This was acccmpilshed for a.r tut one signature (#15-urban), which 
occurred in blocks too small tc be -ealist icaily defined as training areas. 

The remaining permanent training areas denned as above for the 21 
signatures weve then processed by the program STATS. The foiicwing infor- 
mation was obtained for each of the signatures; 

1. New signatures - These were obtained trjm the statistics of 
the permanent training areas, as was expected, these varied 
very ilttie tzom the criglnai. signatures used in the classifi- 
cation maps from which the training areas were selected. 


^See ORSER-SSEL lechr.icai. Reports 10-75 or ii-75 lor descriptions of 
ORSER program: 
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2. HisCograms for each band - These were to check for a normal 
distribution of each signature and to insure that the training 
areas were correctly defined. 

3. Variatice-cot^ar lance matrix of mean result of each channel - 
This IS an essential Input for CANAL. 

4. Number of observations In each training area - gam essential 
for CANAL. 

Once the signatures, number of observations, and variance-covariance 
matrix for each of the signatures were completed they were used as Input 
to the CANAL program. 

Canonical, or iiultlple discriminant, analysis generates a linear trans- 
formation matrix C*. The transformation Is of the form 


Ylj = C*X 


IJ 


where. In this case, Is a sixteen channel raw Skylab element and Y 
is a three axis transfo"?med element. The object is to maximize the ^ 
separability of the categories. This is done by maximizing the variance 
among categories subject to the constraint that the axes be orthogonal. 


In addition, the transformed elements must have unit variance within 
categories. This may or may not reduce the variance within the category 
relative to the untransformed data. For Skylab and most other remotely 
sensed data,i.he transformation will usually reduce within-category 
variance. To the extent that it occurs, this reduction of within-category 
variance, coupled with the theoretically predicted maximum in among- 
category variance, constitutes a partial classification of the data. 


RESULTS 


The original transf ormaticn matrix, C, had sixteen axes. However, the 
first three axes contained 98.83% of the variance contained within the 
transformation. The values for axis one, two, and three were 83.61%, 
14.49%, and 0.72%, respectively . ^'Therefore, only the first three axes were 
used in transformation C*. Used in this way, the canonical transformation 
is a form of feature selection, since it means that data having sixteen 
spectral values for each pixel Is transformed to data having only three 
values for each pixel. The result was an 81.25% reduction in data bulk. 

It Is expected that using transformed data for classification will result 
in significant reductions in computer cost. 

Table 1 ' ists the axls-to-channel correlation for the first three 
axes from the CANAT, run As can be seen, the highest correlations for axis 
1 occur fer chanueff 8, 9, 10, 19, 20, 12, and 17. Axis 2 (oithogonal to 
axis 1) has the highest axis-to-channel cci relation for channels 1, 2, 3, 

4, 5, 6, 12, 13, and 14, Finally, axis 3 (orthogonal to both axis 1 and 
axis 2) is highly correlated with channel 21 only. Although this axis 
contained only 0.72% of the total variance, it war decided to Include it 
In the transformation simply because of its high corvelotion to the single 
thermal band. 



Table 1: Axis-to-Channel Coiielacions Table 2: The Transformation Matilx, C* 
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Table 2 lists the cransf ormation natiix C* used In the transformation 
formula to transform the sixteen channels into three axes. 

Having developed the transformation matiix and analyzed the resulting 
co*'*'elations, work to eva^jate the ciassit icat ten, now possible using the 
transformed data, has begun. Hopefully, by knowing which channels are 
highly correlated to each axis and by observing the ability of the indi- 
vidual axis to map certain targets, we should get a better understanding 
of the importance of each individual band in mapping various target types. 
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